A Repeated Imitation Model with Dependence Between Stages: Decision Strategies and Rewards

نویسندگان

  • Pablo J. Villacorta
  • David A. Pelta
چکیده

Adversarial decision making is aimed at determining strategies to anticipate the behavior of an opponent trying to learn from our actions. One defense is to make decisions intended to confuse the opponent, although our rewards can be diminished. This idea has already been captured in an adversarial model introduced in a previous work, in which two agents separately issue responses to an unknown sequence of external inputs. Each agent’s reward depends on the current input and the responses of both agents. In this contribution, (a) we extend the original model by establishing stochastic dependence between an agent’s responses and the next input of the sequence, and (b) we study the design of time varying decision strategies for the extended model. The strategies obtained are compared against static strategies from theoretical and empirical points of view. The results show that time varying strategies outperform static ones.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Asymptotically Efficient Adaptive Strategies in Repeated Games Part I: Certainty Equivalence Strategies

This paper addresses the problem of dynamic decision making in an uncertain and competitive environment. A decision maker (player 1) faces a system about which he has some (parametric) uncertainty, and which is affected also by the actions of other agents. We focus on a worst-case analysis from the viewpoint of player 1, using the simplified model of a repeated matrix game with lack of informat...

متن کامل

Learning Imitation Strategies Using Cost-based Policy Mapping and Task Rewards

Learning by imitation represents a powerful approach for efficient learning and low-overhead programming. An important part of the imitation process is the mapping of observations to an executable control strategy. This is particularly important if the capabilities of the imitating and the demonstrating agent differ significantly. This paper presents an approach that addresses this problem by o...

متن کامل

Designing a Model for Managing the Ethical and Job Attitudes of Employees in Government Agencies

Background: The purpose of this study is to design a model for managing the ethical and professional attitude of employees in government organizations. Method: The present research method is a mixed (qualitative-quantitative) research. In the qualitative part, the strategy used was the Foundationchr('39')s data theory (Grand Theory). The statistical population consisted of university professor...

متن کامل

تأثیر آموزش تقلید متقابل بر مهارتهای اجتماعی کودکان دارای اوتیسم

Objective: the present research aimed to determine the effect of reciprocal imitation training on social skills of children with Autism. Materials & Methods: This was a qusi-experimental study with repeated measures. Fourteen 5 to 7 years old children with high function autism (3 girls and 11 boys) were selected in convenience from who were referred to one private clinic at Tehran in 2012-20...

متن کامل

Dynamic vs. Static Decision Strategies in Adversarial Reasoning

Adversarial decision making is aimed at determining optimal decision strategies to deal with an adversarial and adaptive opponent. One defense against this adversary is to make decisions that are intended to confuse him, although our rewards can be diminished. It is assumed that making decisions in an uncertain environment is a hard task. However, this situation is of upmost interest in the cas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Applied Mathematics and Computer Science

دوره 25  شماره 

صفحات  -

تاریخ انتشار 2015